Forward Selection Procedure for Linear Model Building Using Spearman’s Rank Correlation
نویسندگان
چکیده
Forward selection (FS) is a step-by-step model-building algorithm for linear regression. The FS algorithm was expressed in terms of sample correlations where Pearson’s product-moment correlation was used. The FS yields poor results when the data contain contaminations. In this article, we propose the use of Spearman’s rank correlation in FS. The proposed method is called FSr. We conduct an extensive simulation study to compare the performance of FSr with FS. The proposed FSr performs better than the FS algorithm in the contaminated data. We also demonstrate a real data application of FSr.
منابع مشابه
COMPARISON OF VALUES OF PEARSON’S AND SPEARMAN’S CORRELATION COEFFICIENTS ON THE SAME SETS OF DATA jan hauke, tomasz kossowski
Spearman’s rank correlation coefficient is a nonparametric (distribution-free) rank statistic proposed by Charles Spearman as a measure of the strength of an association between two variables. It is a measure of a monotone association that is used when the distribution of data makes Pearson’s correlation coefficient undesirable or misleading. Spearman’s coefficient is not a measure of the linea...
متن کاملMultivariate Spearman’s rho for rank aggregation
We study the problem of rank aggregation: given a set of ranked lists, we want to form a consensus ranking. Our main contribution is the derivation of a nonparametric estimator for rank aggregation based on multivariate extensions of Spearman’s ρ, which measures correlation between a set of ranked lists. Multivariate Spearman’s ρ is defined using copulas, and we show that the geometric mean of ...
متن کاملMultivariate Spearman’s ρ for Aggregating Ranks Using Copulas
We study the problem of rank aggregation: given a set of ranked lists, we want to form a consensus ranking. Furthermore, we consider the case of extreme lists: i.e., only the rank of the best or worst elements are known. We impute missing ranks and generalise Spearman’s ρ to extreme ranks. Our main contribution is the derivation of a non-parametric estimator for rank aggregation based on multiv...
متن کاملSpeedy Model Selection (SMS) for Copula Models
We tackle the challenge of efficiently learning the structure of expressive multivariate realvalued densities of copula graphical models. We start by theoretically substantiating the conjecture that for many copula families the magnitude of Spearman’s rank correlation coefficient is monotonic in the expected contribution of an edge in network, namely the negative copula entropy. We then build o...
متن کاملMeasurements to determine the ranking accuracy of perceptual models
Linear regression is commonly used in the audio industry to create objective measurement models that predict subjective data. For any model development, the measure used to evaluate the accuracy of the prediction is important. The most common measures assume a linear relationship between the subjective data and the prediction, though in the early stages of model development this is not always t...
متن کامل